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(57) Abstract 

Hie present invention relates to nucleotide sequences of the NITI gene and amino acid sequences of its encoded proteins, as well as 
derivatives and analogs thereof. Additionally, the present invention relates to the use of nucleotide sequences of NITI genes and amino 
acid sequences of their encoded proteins, as well as derivatives and analogs thereof and antibodies thereto, as diagnostic and therapeutic 
reagents for the detection and treatment of cancer. The present invention also relates to therapeutic compositions comprising Nitl proteins, 
derivatives or analogs thereof, antibodies thereto, nucleic acids encoding the Nitl proteins, derivatives, or analogs and NITI antisense 
nucleic acids, and vectors containing the NITI coding sequence. 
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NITRILASE HOMOLOGS 



5 FIELD OF THE INVENTION 



The present invention generally relates to the field of oncology and tumor 
suppressor genes, and more particularly to the structure and function of the NIT1 
gene, the structure of its encoded proteins, and the use of NIT1 genes and the NIT1 
10 related genes and their encoded proteins and vectors containing the NIT J coding 
sequence as diagnostic and therapeutic reagents for the detection and treatment of 
cancer. 



15 BACKGROUND OF THE INVENTION 

Introduction 



The present invention relates to nucleotide sequences of the NIT1 gene and 
20 amino acid sequences of its encoded proteins, as well as derivatives and analogs 
thereof. Additionally, the present invention relates to the use of nucleotide 
sequences of NIT1 genes and amino acid sequences of their encoded proteins and 
vectors containing the NIT1 coding sequence, as well as derivatives and analogs 
thereof and antibodies thereto, as diagnostic and therapeutic reagents for the 
25 detection and treatment of cancer. The present invention also relates to therapeutic 
compositions comprising Nitl proteins, derivatives or analogs thereof, antibodies 
thereto, nucleic acids encoding the Nitl proteins, derivatives, or analogs, and M77 
antisense nucleic acids, and vectors containing the NIT1 coding sequence. 



30 
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Approaches to Elucidation and Characterization ofNITl 

The tumor suppressor gene FHIT encompasses the common human 
5 chromosomal fragile site at 3pl4.2 and numerous cancer cell bi-allelic deletions. 
To study Fhit function, Fhit genes in D. melanogaster and C. elegans were cloned 
and characterized. The Fhit genes in both of these organisms code for fusion 
proteins in which the Fhit domain is fused with a novel domain showing homology 
to bacterial and plant nitrilases; the D. melanogaster fusion protein exhibited 
10 diadenosine triphosphate (ApppA) hydrolase activity expected of an authentic Fhit 
homolog. 

In human and mouse, the nitrilase homologs and Fhit are encoded by two 
different genes, FHIT and NIT I, localized on chromosomes 3 and 1 in human, and 
14 and 1 in mouse, respectively. Human and murine NIT1 genes were cloned and 

15 characterized, their exon-intron structure, their patterns of expression, and their 
alternative mRNA processing were determined. 

The tissue specificity of expression of murine FHIT and NITJ genes was 
nearly identical. Typically, fusion proteins with dual or triple enzymatic activities 
have been found to carry out specific steps in a given biochemical or biosynthetic 

20 pathway; Fhit and Nitl, as fusion proteins with dual or triple enzymatic activities, 
likewise collaborate in a biochemical or cellular pathway in mammalian cells. 

Importance of FHIT 

25 The human FHIT gene at chromosome 3pl4.2, spanning the constitutive 

chromosomal fragile site FRA3B, is often altered in the most common forms of 
human cancer and is a tumor suppressor gene. The human FHIT gene is greater 
than one megabase in size encoding an mRNA of 1.1 kilobases and a protein of 
147 amino acids. 

30 The rearrangements most commonly seen are deletions within the gene. 

These deletions, often occurring independently in both alleles and resulting in 
inactivation, have been reported in tumor-derived cell lines and primary tumors of 
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lung, head and neck, stomach, colon, and other organs. In cell lines derived from 
several tumor types, DNA rearrangements in the FHIT locus correlated with RNA 
and/or Fhit protein alterations. 
5 Because the inactivation of the FHIT gene by point mutations has not been 

demonstrated conclusively and because several reports have shown the 
amplification of aberrant-sized FHIT reverse transcription-PCR (RT-PCR) 
products from normal cell RNA, a number of investigators have suggested that the 
FHIT gene may not be a tumor suppressor gene. On the other hand it has been 
10 reported that re-expression of Fhit in lung, stomach and kidney tumor cell lines 
lacking endogenous protein suppressed tumorigenicity in vivo in 4 out of 4 cancer 
cell lines. This suggests that FHIT is indeed a tumor suppressor gene. It is noted 
that a report has suggested that Fhit enzymatic activity is not required for its tumor 
suppressor function. 

15 Fhit protein is a member of the histidine triad (HIT) superfamily of 

nucleotide binding proteins and is similar to the Schizosaccharomyces pombe 
diadenosine tetraphosphate (Ap 4 A) hydrolase. Additionally it has been reported 
that, in vitro, Fhit has diadenosine triphosphate (ApppA) hydrolase enzymatic 
activity. 

20 Neither the in vivo function of Fhit nor the mechanism of its tumor 

suppressor activity is known. Nonetheless, genetic, biochemical and 
crystallographic analysis suggest that the enzyme-substrate complex is the active 
form that signals for tumor suppression. One approach to investigate function is to 
investigate Fhit in model organisms such as Drosophila melanogaster and 

25 Caenorhabditis elegans. 

The present invention involves the isolation and characterization of the 
NIT1 gene in these organisms. Fhit occurs in a fusion protein, Nit-Fhit, in D. 
melanogaster and C. elegans, but FHIT and NIT1 are separate genes in 
mammalian cells. The human and mouse NIT1 genes are members of an 

30 uncharacterized mammalian gene family with homology to bacterial and plant 
nitrilases, enzymes which cleave nitriles and organic amides to the corresponding 
carboxylic acids plus ammonia. 
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SUMMARY OF THE INVENTION 

Accordingly, it is an object of the present invention to purify a NIT1 gene. 
5 It is a further object of the present invention to purify a NIT1 gene, wherein 

the purified gene is a human gene. 

It is an object of the present invention to purify a NIT1 gene, wherein the 
purified gene is a mammalian gene. 

It is an object of the present invention to purify a Nitl protein. 
10 It is another object of the present invention to purify a Nitl protein, 

wherein the purified protein is a human protein. 

It is another object of the present invention to purify a Nitl protein, 
wherein the purified protein is a mammalian protein. 

Yet another aspect of the present invention is a purified protein encoded by 
15 a nucleic acid having a nucleotide sequence consisting of the coding region of 
SEQ ID NO: 1 (Figure 6). 

Another aspect of the present invention is an antibody capable of binding a 
Nitl protein. 

It is another object of the present invention to isolate a nucleic acid of less 
20 than 100 kb, comprising a nucleotide sequence encoding a Nitl protein. 

Another object of the present invention is a pharmaceutical composition 
comprising a therapeutically effective amount of a Nitl protein; and a 
therapeutically acceptable carrier. 

Another object of the present invention is a method of treating or 
25 preventing a disease or disorder in a subject comprising administering to said 
subject a therapeutically effective amount of a molecule that inhibits Nitl function. 

Another aspect of the present invention is a method of treating or 
preventing a disease or disorder in a subject comprising administering to said 
subject a therapeutically effective amount of a molecule that enhances Nitl 
30 function. 

It is yet another aspect of the present invention to diagnose or screen for the 
presence of or a disposition for developing a disease in a subject, comprising 
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detecting one or more mutations in NIT1 DNA, RNA or Nitl protein derived from 
the subject in which the presence of said one or more mutations indicates the 
presence of the disease or disorder or a predisposition for developing the disease or 
5 disorder. 

It is yet another aspect of the present invention to treat a disease or disorder 
with a vector containing the coding segment of the NIT1 gene. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1. A sequence comparison of human, murine, D. melanogaster, and C. 
elegans Nitl and Fhit proteins. Identities are shown in black boxes, similarities 
are shown in shaded boxes. For human and mouse FHIT GenBank accession 
1 5 numbers are U46922 and AF047699, respectively. 

Fig. 2. Northern blot analysis of expression of NIT1 and FHIT mRNAs in 
murine and human tissues, as well as in D. melanogaster, and C. elegans. (A) 
Mouse multiple tissues Northern blot. Lanes 1-8: heart, brain, spleen, lung, liver, 
skeletal muscle, kidney, and testis. (Top) Fhit probe; (Middle) Nitl probe; 
20 (Bottom) actin probe. (B) Human blot, NIT1 probe. Lanes 1-8: heart, brain, 
placenta, lung, liver, skeletal muscle, kidney, and pancreas. (C) Lanes 1 and 2: D. 
melanogaster adult, D. melanogaster embryo; D. melanogaster Nit-Fhit probe. 
Lane 3: C. elegans adult; C. elegans Nit-Fhit probe. 

Fig. 3. Genomic organization of human and murine NIT1 genes and D. 
25 melanogaster and C elegans Nit-Fhit genes. (A) Exon-intron structure of the 
genes. (B) Alternative processing of human NIT1 gene. 

Fig. 4. Cleavage of ApppA by D. melanogaster Nit-FhiL At indicated 
times of incubation, samples were spotted on TLC plates with appropriate 
nucleotide standards. 

30 Fig. 5. Analysis of alternative transcripts of humatfMTV by RTJPCR. RT- 

PCR of HeLa RNA was performed with primers in different exons. Lanes 1-6: 
exons 1 and 3 (transcript 2); exons 1C and 3 (transcript 5); exons 1 A and 3 
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(transcripts 3, upper band and 4, lower band): exons 2 and 3 (transcripts 2-4); 
exons 1 and 1C (transcript 5); and exons 1 and 2 (transcript 2). 

Fig. 6. Highly conserved sequence of human, murine, D. melanogaster, 
5 and C elegans NIT1 gene. (SEQ ID NO: 1). 

DETAILED DESCRIPTION 

10 Genomic and cDNA clones 

One million plaques of a mouse genomic library (bacteriophage library 
from strain SVJ129, Stratagene, La Jolla, CA) and one hundred thousand plaques 
of a D. melanogaster genomic library were screened with corresponding cDNA 

15 probes. Clones were purified and DNA was isolated. Sequencing was carried out 
using Perkin Elmer thermal cyclers and ABI 377 automated DNA sequencers. 
DNA pools from a human BAC library (Research Genetics, Huntsville, AL) were 
screened by PCR with NIT1 primers (TCTGAAACTGCAGTCTGACCTCA (SEQ 
ID NO:2) and CAGGCACAGCTCCCCTCACTT (SEQ ID NO:3)) according to 

20 the supplier's protocol. The DNA from the positive clone, 31K11, has been 
isolated using standard procedures and sequenced. Chromosomal localization of 
the human NIT1 gene was determined using a radiation hybrid mapping panel 
(Research Genetics) according to the supplier's protocol and with the same 
primers as above. To map murine Nitl gene, Southern blot analysis of genomic 

25 DNA from progeny of a (AEJ/Gn-a bp H /a bp H x M. spretus)Fl x AEJ/Gn-a bp h / a 
bp h backcross was performed using a full length murine Nitl cDNA probe. This 
probe detected a unique 2.0 kb Dral fragment in AEJ DNA and a unique 0.75 kb 
fragment in M. spretus DNA. Segregation of these fragments were followed in 
180 N2 offspring of the backcross. Additional Mit markers (D1MU34, D1MU35, 

30 and D1MU209) were typed from DNA of 92 mice by using PCR consisting of an 
initial denaturation of 4 minutes at 94°C followed by 40 cycles of 94°C for 30 
seconds, 55°C for 30 seconds and 72°C for 30 seconds. Linkage analysis was 
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performed using the computer program SPRETUS MADNESS: PART DEUX. 
Human and mouseJV777 expressed sequence tag (EST) clones were purchased 
form Research Genetics. The sequences of human and murine NIT1 genes and 
5 cDNAs and D. melanogaster and C. elegans Nit-Fhit cDNAs have been deposited 
in GenBank. 

In situ hybridization 

10 D. melanogaster polytene chromosome spreads were prepared from 

salivary glands of third-instar larvae as described. NitFhit DNA fragments were 
labeled with digoxigenin-ll-dUTP using a random-primed DNA labeling kit 
(Boeringer Mannheim, Indianapolis, IN), and were used as probes for the 
chromosomal in situ hybridization. Hybridization was for 20 hours at 37°C in 

15 hybridization buffer: 50% formamide, 2x standard saline citrate (SSC), 10% 
dextran sulfate, 400 mg/ml salmon sperm DNA. Antidigoxigenin-fluorescein 
antibodies (Boehringer Mannheim) were used for detection of hybridizing regions. 
DNA was counterstained with Hoechst 33258 (Sigma, St. Louis, MO). The slides 
were analyzed by fluorescence microscopy. For in situ hybridization, embryos 

20 were fixed and processed as described previously, except that single-stranded 
RNA probes were used. Full length NitFhit cDNA was cloned into Bluescriptll 
KS+ vector and used to synthesize antisense RNA probes with the Genius 4 kit 
(Boehringer Mannheim). 

25 RT-PCR Northern and RACE analysis 

Human and mouse multiple tissue northern blots (Clontech, Palo Alto, CA) 
were hybridized with corresponding NIT1 cDNA probes and washed using the 
supplier's protocol. For the HeLa cell line, total RNA was isolated from 1-5 x 10 8 
30 cells using Trizol reagent (Gibco BRL, Gaithersburg, MD). Z). melanogaster 
PolyA+ RNA was purchased from Clontech. Three \ig of polyA+ RNA or 15 jig 
of total RNA were electrophoresed in 0.8% agarose in a borate buffer containing 
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formaldehyde, transferred to HybondN+ membrane (Amersham, Arlington 
Heights, IL) using standard procedures and hybridized as described above. For 
RT-PCR, 200 ng of polyA+ RNA or 3 ng of total RNA were treated with DNasel 

5 (amplification grade, Gibco BRL) following the manufacturer's protocol DNase- 
treated RNA was used in reverse transcription (RT) reactions as follows: 10 nM 
each dNTP, 100 pmoles random hexamers (oligo (dT) priming was used in some 
cases), DNasel treated RNA, and 200 units of murine leukemia virus (MuLV) 
reverse transcriptase (Gibco BRL), in total volume of 20 \i\ were incubated at 

10 42°C for 1 hour followed by the addition of 10 ng RNase A and incubation at 
37°C for 30 min. One |il of the reaction was used for each PCR reaction. PCR 
reactions were carried out under standard conditions using 10 pmoles of each 
gene-specific primer and 25-35 cycles of 95° 30", 55-60° 30", 72° 1\ Products 
were separated on 1.5% agarose gels and sometimes isolated and sequenced or 

15 cloned and sequenced. Oligo (dT)-primed double-stranded cDNA was synthesized 
by using procedures and reagents from the Marathon RACE cDNA amplification 
kit (Clontech); the cDNA was ligated to Marathon adapters (Clontech). 3* and 5' 
RACE products were generated by long PCR using gene-specific primers and the 
API primer (Clontech). To increase the specificity of the procedure, the second 

20 PCR reaction was carried out by using nested gene-specific primers and the AP2 
primer (Clontech). PCR reactions were performed according to the Marathon 
protocol using the Expand long template PCR system (Boehringer Mannheim) and 
30 cycles of: 94° 30", 60° 30", 68° 4*. RACE products were electrophoreses 
identified by hybridization and sequenced. Degenerate FHIT primers were: 

25 GTNGTNCCNGGNCAYGTNGT (SEQ ID NO:4) and 
ACRTGNACRTGYTTNACNGTYTGNGC (SEQ ID NO:5). D. Melanogaster 
Fhit RACE and RT-PCR primers were: GCGCCTTTGTGGCCTCGACTG (SEQ 
ID NO:6) and CGGTGGCGGAAGTTGTCTGGT (SEQ ID NO:7). C elegans 
Fhit RACE and RT-PCR primers were: GTGGCGGCTGCTCAAACTGG (SEQ 

30 ID NO:8) and TCGCGACGATGAACAAGTCGG (SEQ ID NO:9). Human NIT1 
RT-PCR primers were: GCCCTCCGGATCGGACCCT (SEQ ID NO: 10) (exon 
1); GACCTACTCCCTATCCCGTC (SEQ ID NO:l 1) (exon la); 
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GCTGCGAAGTGCACAGCTAAG (SEQ ID NO: 12) and 

AAACTGAAGCCTCTTTCCTCTGAC (SEQ ID NO: 13) (exon 1c); 

TGGGCTTCATCACCAGGCCT (SEQ ID NO: 14) and 

5 CTGGGCTGAGCACAAAGTACTG (SEQ ID NO:15) (exon 2); 
GCTTGTCTGGCGTCGATGTTA (SEQ ID NO: 16) (exon 3). 

Protein expression and enzymatic characterization 

10 The NIT-FHIT cDNA was amplified with primers 

TGACGTCGACATATGTCAACTCTAGTTAATACCACG (SEQ ID NO: 17) and 
TGGGTACCTCGACTAGCTTATGTCC (SEQ ID NO: 18), digested with Ndel 
and Kpnl, and cloned into plasmid pSGA02 as a Nde\-Kpn\ fragment. 
Escherichia coli strain SGI 00 transformants were grown in Luria-Bertani with 

15 100 jig/ml of ampicillin and 15 ng/ml of chloramphenicol at 15°C. When the 
culture reached an optical density (600 nm) of 0.25, isopropyl B-D-thiogalactoside 
was added to a final concentration of 200 |iM. NitFhit protein was purified from 
inclusion bodies as described. Briefly, the cell pellet from a 1 -liter culture was 
resuspended in 50 ml of 20 mM Tris-HCl (pH 7.5), 20% sucrose, ImM EDTA and 

20 repelleted. Outer cell walls were lysed by resuspension in ice-water. Spheroblasts 
were pelleted, resuspended in 140 mM NaCl, 2.7 mM KC1, 12 mM Na«P04 (pH 
7.3), 5mM EDTA, 500mM phenylmethylsulfonyl fluoride, 1 ng/ml leupeptin and 
20 jig/ml of aprotinin, and sonicated. The resulting inclusion body preparation 
was washed and solubilized in 5 M guanidinium hydrochloride, 50mM Tris»HCl 

25 (pH 8.0), 5mM EDTA. Soluble NitFhit protein was added dropwise to 250ml of 
50mM Tris*HCl (pH 8.0), ImM DTT, 20% glycerol at 40°C. After a 14 hour 
incubation, the 13-kg supernatant was concentrated 100-fold with a Centricon 
filter. A 1 -liter culture yielded approximately 200 jig of partially purified, soluble 
NitFhit. ApppA hydrolase activity was assayed at 30°C in 20 \i\ of 50mM 

30 Na»HEPES pH 7.5, 10% glycerol, 0.5 mM MnC12, 4mM ApppA, 1 nM NitFhit. 
TLC plates were developed as described. 
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Cloning and characterization of D. melanogaster and C elegans 

Fhit homolops 

5 To obtain D. melanogaster Fhit sequences, degenerate primers were 

designed in the conserved regions of exons 5 and 7 of human FHIT. RT-PCR 
experiments with these primers and D. melanogaster RNA resulted in an -200 bp 
product, which when translated showed -50% identity to human Fhit protein. This 
sequence was used to design specific D. melanogaster Fhit primers. 5' and 3* 

10 RACE with these primers resulted in -1.5 kb full length cDNA (including 
polyadenylation signal and Poly(A) tail) encoding a 460 amino acid protein with a 
145 amino acid C-terminal part homologous to human Fhit (40% identity and 47% 
similarity) and a 315 amino acid N-terminal extension (Fig. 1). Northern analysis 
(Fig. 2C) showed a singer band of -1.5 kb in both embryo and adult Z>. 

1 5 melanogaster confirming that the full length cDNA has been cloned. 

The 460 amino acid predicted protein sequence was used in a BLASTP 
search. Of the top 50 scoring alignments, 22 aligned with the 145 residue C- 
terminal segment (Fhit-related sequences) and 28 aligned with the 315 residue N- 
terminal segment. The 28 sequences aligning with the N-terminus were led by an 

20 uncharacterized gene from chromosome X of Saccharomyces cerevisiae (P-value 
of 1.4 x 10" 45 ), followed by uncharacterized ORFs of many bacterial genomes and 
a series of enzymes from plants and bacteria that have been characterized as 
nitrilases and amidases. Thus, the 460 amino acid predicted protein contains an N- 
terminal nitrilase domain and a C-terminal Fhit domain and was designated 

25 NitFhit. 

The D. melanogaster Nit-Fhit cDNA probe was used to screen a D. 
melanogaster lambda genomic library. Sequencing of positive clones revealed 
that the gene is intronless and, interestingly, the 1.5-kb Nit-Fhit gene is localized 
within the 1 ,6-kb intron 1 of the D. melanogaster homolog of the murine glycerol 
30 kinase (Gyk) gene. The direction of transcription of the Nit-Fhit gene is opposite 
to that of the Gyk gene (Fig. 3A). It is not known if such localization affects 
transcriptional regulation of these two genes. 
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The cytological position of the Nit-Fhit gene was determined by in situ 
hybridization to salivary gland polytene chromosomes. These experiments 
showed that there is only one copy of the sequence which was localized to region 
5 61 A, at the tip of the left arm of chromosome 3. Digoxigenin-labeled RNA probes 
were hybridized to whole-mount embryos to determine the pattern of expression 
during development. Nit-Fhit RNA was uniformly expressed throughout the 
embryo suggesting that NitFhit protein could be important for most of the 
embryonic cells. 

10 Because human Fhit protein and the D. melanogaster Fhit domain were 

only 40% identical, to show that the authentic D. melanogaster Fhit homolog was 
cloned, its enzymatic activity was tested. Fig. 4 shows that recombinant D. 
melanogaster NitFhit is capable of cleaving ApppA to AMP and ADP and 
therefore possesses ApppA hydrolase activity. 

15 

C. elegans 

Fhit genomic sequences were obtained from the Sanger database (contig 
Y56A3) by using BLAST searches. 5' and 3 1 RACE with C. elegans Fhit specific 

20 primers yielded a 1.4-kb cDNA (including polyadenylation signal and Poly(A) 
tail) coding for a 440 amino acid protein (Fig. 1). Northern analysis (Fig. 2C) 
showed a single band of a similar size in adult worms. Similarly to D. 
melanogaster, the C. elegans protein contained an N-terminal nitrilase domain and 
a C-terminal Fhit domain (Fig. 1) with 50% identity and 57% similarity to human 

25 Fhit. Comparison between C. elegans Nit-Fhit cDNA and genomic sequences 
from the Sanger database revealed that the C elegans Nit-Fhit gene comprises 8 
exons and is more than 6.5 kb in size (Fig. 3A); the nitrilase domain is encoded by 
exons 1-6, and the Fhit domain is encoded by exons 6-8. D. melanogaster and C. 
elegans NitFhit proteins are 50% identical and 59% similar and exhibit several 

30 conserved domains (Fig. 1). 
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Cloning and characterized of human and murine NIT cDNAs and genes 

Because Fhit and nitrilase domains are part of the same polypeptides in D. 

5 melanogaster and C. elegans, it is reasonable to suggest that they may be involved 
in the same biochemical or cellular pathway(s) in these organisms. Because 
nitrilase homologs are conserved in animals, the mammalian nitrilase homologs 
were cloned as candidate Fhit-interacting proteins. 

To obtain human and murine NIT1 sequences, the Z). melanogaster nitrilase 

10 domain sequence was used in BLAST searches of the GenBank EST database. 
Numerous partially sequenced human and murine NITJ ESTs were found. All 
mouse Nitl ESTs were identical, as were all human NITJ ESTs, suggesting the 
presence of a single NIT1 gene in mouse and human. To obtain the full-length 
human and mouse cDNAs, several human and mouse ESTs and human 5' and 3' 

15 RACE products were completely sequenced. This resulted in the isolation of a 
~1.4-kb full-length human sequence encoding 327 amino acids and a ~1.4-kb 
mouse full-length sequence coding for 323 amino acids (Fig. 1), although several 
alternatively spliced products were detected in both cases (see below and Fig, 3B). 
Both cDNAs are polyadenylated, but lack polyadenylation signals, although AT- 

20 rich regions are present at the very 3' end of each cDNA. Mouse and human Nitl 
amino acid sequences were 90% identical; the human Nitl amino acid sequence 
was 58% similar and 50% identical to the C elegans nitrilase domain and 63% 
similar and 53% identical to the D. melanogaster nitrilase domain (Fig. 1). 

Murine lambda and human BAC genomic libraries were screened with the 

25 corresponding NITJ cDNA probes, yielding one mouse lambda clone and one 
human BAC clone containing the NITJ genes. The human and murine NITJ 
genomic regions were sequenced and compared to the corresponding cDNA 
sequences. The genomic structure of human and mouse NIT1 genes is shown in 
Fig. 3A. Both genes are small: the human gene is -3.2 kb in size and contains 7 

30 exons; the murine gene is -3.6 kb in size and contains 8 exons. Southern analysis 
confirmed that both human and mouse genomes harbor a single NTT1 gene. 
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A radiation hybrid mapping panel (GeneBridge 4) was used to determine 
the chromosomal localization of the human NIT1 gene. By analysis of PGR data at 
the Whitehead/MIT database (http://www-genome.Mn.mit.edu), the NIT1 gene was 
5 localized 6.94 cR from the marker CHLC. G AT A43 A04, which is located at lq21- 
lq22. 

A full length murine Nitl cDNA probe was used to determine the 
chromosomal location of the murine gene by linkage analysis. Interspecific 
backcross analysis of 180 N 2 mice demonstrated that the Nitl locus cosegregated 

10 with several previously mapped loci on distal mouse chromosome 1. The region 
to which Nitl maps was further defined by PCR of genomic DNA from 92 N 2 mice 
using the markers D1MU34, D1MU35 and D1MU209 (Research Genetics). The 
following order of the genes typed in the cross and the ratio of recombinants to N 2 
mice was obtained: centromere - D1MU34 - 7/78 - D1MU35 - 8/90 - Nitl - 1 1/91- 

15 D1MU209 - telomere. The genetic distances given in centiMorgans (±S.E.) are as 
follows: centromere - D1MU209 - 9.0 ± 3.2 - D1MU35 - 8.9 ± 3.0 - Nitl - 12.1 
±3.4 - D1MU209 - telomere. This region of mouse chromosome 1 (lq21 - lq23) 
is syntenic to human chromosome Iq and is consistent with the localization of the 
human ortholog of NitL 

20 

Expression and alternative splicins of human and murine Nitl zenes 

For the human gene, Northern analysis revealed two major transcripts of 
-1.4 kb and -2.4 kb in all adult tissues and tumor cell lines tested. A third band of 

25 -1.2 kb was observed in adult muscle and heart (Fig, 2B). The longest cDNA 
(-1.4 kb) corresponds to the -1.4-kb transcript observed on Northern blots. The 
1.2-kb band corresponds to transcript 1 on Fig, 3B (see below). It is not known if 
the -2.4-kb RNA represents an additional transcript or an incompletely processed 
mRNA. No significant variation in human NIT1 mRNA levels was observed in 

30 different tissues (Fig. 2B). On the contrary, different mouse tissues showed 
different levels of expression of Nitl mRNA (Fig. 2A). The highest levels of Nitl 
mRNA were observed in mouse liver and kidney (Fig. 2A, Middle, lanes 5 and 7). 
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Interestingly, the pattern of Nitl expression was aimost identical to the pattern of 
the expression of Fhit (Fig. 2A, Top and Middle), supporting the hypothesis that 
the proteins may act in concert or participate in the same pathway. 

5 Analysis of mouse Nitl ESTs revealed that some transcripts lack exon 2 

and encode a 323 amino acid protein. An alternative transcript containing exon 2 
encodes a shorter, 290 amino acid protein starting with the methionine 34 (Fig. 1). 

Analysis of human ESTs and 5* RACE products from HeLa and testis also 
suggested alternative processing. To investigate this, a series of RT-PCR 

10 experiments was carried out. Fig. 5 shows the results obtained from HeLa RNA 
(similar results were obtained using RNAs from the MDA-MB-436 breast cancer 
cell line and adult liver). The alternatively spliced transcripts are shown on Fig. 
3B. Transcript 1, lacking exon 2, was represented by several ESTs in the Genbank 
EST database. This transcript probably corresponds to the ~1.2-kb transcript 

15 observed on Northern blots in adult muscle and heart. Transcript 2 encoding the 
327 amino acid Nitl protein (Fig. 1) is a major transcript of human NIT1 at least in 
the cell lines tested. This transcript lacks exons la and lb. Transcript 3 has exon 
la and lb; transcript 4 has exon la but lacks exon lb (Fig. 3B). It is not known if 
transcript 5 (lacking exon 2) starts from exon 1 or 1c. 

20 The alternative initiating methionines of different transcripts are shown on 

Fig. 3B. Data suggest that at least in COS-7 cells transfected with a construct 
containing transcript 2, the methionine in exon 3 (shown in transcripts 1 and 3, 
Fig. 3B) initiates more efficiently than the methionine in exon 2 (Fig. 3B, 
transcript 2). 

25 

Discussion 

Although the frequent loss of Fhit expression in several common human 
cancers is well documented, and results supporting its tumor suppressor activity 
30 have been reported, the role of Fhit in normal and tumor cell biology and its 
mechanism of its action in vivo are unknown. The Ap 3 A hydrolytic activity of 
Fhit seems not to be required for its tumor suppressor function, and it has been 
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suggested that the enzyme-subtract complex is the active form of Fhit. To 
facilitate an investigation of Fhit function, a model organisms approach was 
initiated by cloning and characterization of D. melanogaster and C elegans Fhit 
5 genes. 

Surprisingly, in flies and worms, Fhit is expressed as a fusion protein with 
the Fhit domain fused into a "Nit" domain showing homology to plant and 
bacterial nitrilases. Human and murine NIT] genes were further isolated. Nit and 
Fhit are expressed as separate proteins in mammals but, at the mRNA level, are 

1 0 coordinately expressed in mouse tissues. 

In several eukaryotic biosynthetic pathways multiple steps are catalyzed by 
multifunctional proteins containing two or more enzymatic domains. The same 
steps in prokaryotes frequently are carried out by monoenzymatic proteins that are 
homologs of each domain of the corresponding eukaryotic protein. For example, 

15 Gars, Gart and Airs are domains of the same protein in D. melanogaster and 
mammals. These domains catalyze different steps in de novo synthesis of purines. 
In yeast, Gart homolog (Ade8) is a separate protein and Gars and Airs homologs 
(Ade5 and Ade7) are domains of a bienzymatic protein; in bacteria, all three 
homologs (PurM, PurN and PurD) are separate proteins. De novo pyrimidine 

20 biosynthesis illustrates a similar case. Recently, a fusion protein of a lipoxygenase 
and catalase, both participating in the metabolism of fatty acids, has been 
identified in corals. In all of these examples, if domains of a multienzymatic 
protein in some organisms are expressed as individual proteins in other organisms, 
the individual proteins participate in the same pathways. This observation and the 

25 fact that Fhit and Nitl exhibit almost identical expression patterns in murine 
tissues suggest that Fhit and Nitl participate in the same cellular pathway in 
mammalian cells. 
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WHAT IS CLAIMED IS; 

1 . A purified NIT1 gene. 

5 

2. The gene of claim 1 which is a human gene. 

3. The gene of claim 1 which is a mammalian gene. 
10 4. A purified Nitl protein. 

5. The protein of claim 4 which is a human protein. 

6. A purified protein encoded by a nucleic acid having a 
1 5 nucleotide sequence consisting of the coding region of SEQ ID NO: 1 . 

7. An antibody which is capable of binding a Nitl protein. 

8. The antibody of claim 7 which is monoclonal. 

20 

9. A molecule comprising a fragment of the antibody of claim 
7, which fragment is capable of binding a Nitl protein. 

10. An isolated nucleic acid of less than 100 kb, comprising a 
25 nucleotide sequence encoding a Nitl protein. 

11. The nucleic acid of claim 10 in which the Nitl protein is a 
human Nitl protein. 

30 12, A pharmaceutical composition comprising a therapeutically 

effective amount of a Nitl protein; and a therapeutically acceptable carrier. 
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13. A method of treating or preventing a disease or disorder in a 
subject comprising administering to said subject a therapeutically effective amount 
of a molecule that inhibits Nitl function. 

5 

14. A method of treating or preventing a disease or disorder in a 
subject comprising administering to said subject a therapeutically effective amount 
of a molecule that enhances Nitl function. 

10 15. A method of diagnosing or screening for the presence of or a 

predisposition for developing a disease or disorder in a subject comprising 
detecting one or more mutations in NIT1 DNA, RNA or Nitl protein derived from 
the subject in which the presence of said one or more mutations indicates the 
presence of the disease or disorder or a predisposition for developing the disease or 

15 disorder. 

16. A method of treating or preventing a disease or disorder in a 
subject by using a vector containing the NIT1 gene coding sequence. 

20 
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introns: 



Human NIT1 

ia:198bp 1b;243bp 2:252bp 3: 120bp 4;206bp 5; 154bp 6;266bp 



Y//A 



^ ^ — uzr 



exons: 1. 1a, 1b; >115bp 1c;>110bp 2;96bp 3;255bp 4;104bp 5; 134bp 6;126bp 7: 568&D 
introns: i;256bp 



Murine Nlt1 

2;190bp 3;374bp 4: 132bp 5;243bp 6; 130bp 7;640bp 



-V77T 



Y//A 



~W2i Wk W5* 



Y/MY/M 



exons: l:>65bp 2:124bp 3:87bp 4:252bp 5;97bp 6: 134bp 7: 126bp 8: 553bp 

5;>1,5kb 6;63bp 7; 52bp 



introns: 



C. Qiegans NitFhit 

1;>1.6kb 2:158bp 3: 46bp 4;2046bp 
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D. metanogaster NitFhit 



— 

5;310bp 



— E3 — 

6;225bp 7; 176bp 8: 202bp 



gykexorxZ 1.5 kb Intronless 



B 




M IAAAAA 



M 



M 



M 



I - start of ORFs; ' - major transcript 



Fig. 3 



WO 00/03685 



PCTYUS99/16366 



4/6 




ApppA 



AMP 




Fig. 4 



WO 00/03685 



FCT/US99/16366 




Fig. 5 



WO 00/03685 



6/6 



1 GCCCACTCGC TGCGGCCTNT CTGGCTCCAG 
P L A A A ? LAPD 
61 TTTTGGCTAT ATCTTCATOT A3GACCTACT 
LAI SSC RTYS 
121 TCACCAGGCC TCCTCACAGA TTCCTGTCCC 
TRP PHR FLSL 
181 TCTCAGTACT TTGTGCTCAG CCCAGGCCCA 
SVL CAQ PRPR 
241 AACTGCCCCT GGTGGCTGTG TGCCAGGTAA 
L P L VAV CQVT 
301 AAACATGTGC TGAGCTGGTT CGAGAGGCTG 
TCA ELV REAA 
361 CTGAGGCATT TGACTTCATT GCACGGGACC 
EAF DFI ARDP 
421 TGGGTGGGAA ACTTTTGGAA GAATACACCC 
GGK L L E EYTQ 
481 CCTTGGGTGG TTTCCATGAG CGTGGCCAAG 
LGG FHE RGQD 
S41 GTCACGTGCT GCTGAACAGC AAAGGGGCAG 
H V L LNS KGAV 
601 GTGACGTAGA GATTCCAGGG CAGGGGCCTA 
DVE IPG QGPM 
661 CCAGTCTTGA GTCACCTGTC AGCACACCAG 
SLE SPV STPA 
721 ACATGCGGTT CCCTGAACTC TCTCTGGCAT 
MRF PEL SLAL 
781 ATCCTTCAGC TTTTGGATCC A1TACAGGCC 
PSA F G S l/T G P 
841 GTGCTATCGA AACCCAGTGC TATGTAGTGG 
A I E TQC YVVA 
901 AGAGAGCAAG TTATGGCCAC AGCATGGTGG 
RAS Y G H SMVV 
961 GCTCTGAGGG GCCAGGCCTC TGCCTTGCCC 
SEG PGL CLAR 
1021 GCCGACACCT GCCTGTGTTC CAGCACCGCA 
RHL PVF QHRR 
1081 CACTGTCTTA AGACITGACT TCTGTGAGTT 
LS. DLT SVSL 
1141 ACTATGAGCT AGTGCTCATG TGACTTGGAG 
YELVLM .LGG 
1201 GAGAACCTTG ACTCTCTTGA TGGAACACAG 
EP. LS. WNTD 
1261 GAGCTTCACC TGAGGTCAGA CTGCAGTTTC 
ASP E V R LQFQ 
1321 TATTTCATGG AAACTGAAGT TCTGCTGAGG 
FHG N.S SAEG 
1381 TAATCATAAA GTCAAAAAAA AAAAAAAAAA 
IIK SKK KKKK 
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NED 
[Strard] 

ACCGCCCTCC GGATCGGACC CTGCGAATGG 
RPP DRT LRMV 
CCCTATCCCG TCGGCCGCGG CTGGGCTICA 
LSR RPR LGFI 
TTCTGTGTCC TGGACTCCGG ATACCTCAAC 
LCP GLR IPQL 
GAGCCATGGC TATCTCCTCT TCCTCCTGCG 
A M A ISS SSCE 
CATCGACGCC AGACAAGCAA CAGAACTTTA 
STP DKQ QNFK 
CCAGACTGGG TGCCTGCCTG GCTTTCCTGC 
RLG ACL AFLP 
CTGCAGAGAC GCTACACCTG TCTGAACCAC 
AET LHL SEPL 
AGCTTGCCAG GGAATGTGGA CTCTGGCTGT 
LAR ECG LWLS 
ACTGGGAGCA GACTCAGAAA ATCTACAATT 
WEQ TQK IYNC 
TAGTGGCCAC TTACAGGAAG ACACATCTGT 
VAT YRK THLC 
TGTGTGAAAG CAACTCTACC ATGCCTGGGC 
CES NST MPGP 
CAGGCAAGAT TGGTCTAGCT GTCTGCTATG 
G K I GLA VCYD 
TGGCTCAAGC TGGAGCAGAG ATACTTACCT 
A Q A GAE ILTY 
CAGCCCACTG GGAGGTGTTG CTGCGGGCCC 
AHW EVL LRAR 
CAGCAGCACA GTGTGGACGC CACCATGAGA 
A A Q CGR HHEK 
TAGACCCCTG GGGAACAGTG GTGGCCCGCT 
DPW GTV VARC 
GAATAGACCT CAACTATCTG CGACAGTTGC 
IDL NYL RQLR 
GGCCTGACCT CTATGGCAAT CTGGGTCACC 
PDL YGN LGHP 
TAGACCTGCC CCTCCCACCC CCACCCTGCC 
DLP LPP PPCH 
GCAGGATCCA CCCACAGCTC CCCTCACTTG 
RIQ AQL PSLG 
ATGGGCTGCT TGGGAAAGAA ACTTICACCT 
GLL G K E T'FT. 
AGAAAGGTGG AATTITATAT AGTCATTGTT 
KGG ILY SHCL 
GCTGAGCAGC ACTGGCAITG AAAAATATAA 
.AA LAL KNII 
AAAAAA 
K 
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